Rank in Wordlist | Frequency | Word |
---|---|---|
2272 | 5 | 10,000 |
2302 | 5 | 2,000 |
3783 | 3 | 1,000 |
3840 | 3 | 3,000 |
5499 | 2 | 1,200 |
5500 | 2 | 1,500 |
5503 | 2 | 10,000,000 |
5504 | 2 | 100,000 |
5599 | 2 | 20,000 |
5600 | 2 | 200,000 |
Rank in Wordlist | Frequency | Word |
---|---|---|
12568 | 1 | Debonaire(778 |
12963 | 1 | Emperor(814 |
13415 | 1 | Fu(Pinginisce:Dù |
16371 | 1 | Panzer(kampfwagen |
17167 | 1 | Scōle(1922-1926 |
17267 | 1 | Showdown'(ēac |
26466 | 1 | oneardendum(swa |
Rank in Wordlist | Frequency | Word |
---|---|---|
12989 | 1 | Englisce)᛬ |
15368 | 1 | Manitoba). |
22170 | 1 | frēo)--and |
22363 | 1 | fēlend)-seonu-swefn |
23851 | 1 | gēares). |
Rank in Wordlist | Frequency | Word |
---|---|---|
5497 | 2 | 0.24% |
5620 | 2 | 3.5% |
5663 | 2 | 70% |
5679 | 2 | 97% |
9928 | 1 | 1.22% |
9932 | 1 | 10% |
9975 | 1 | 11% |
10034 | 1 | 13.3% |
10060 | 1 | 14.7% |
10061 | 1 | 14.72% |
Rank in Wordlist | Frequency | Word |
---|---|---|
11854 | 1 | C&T |
16677 | 1 | R&D |
Rank in Wordlist | Frequency | Word |
---|---|---|
9889 | 1 | $1.50 |
9890 | 1 | $12,586 |
9891 | 1 | $124,000,000 |
9892 | 1 | $19,821 |
9893 | 1 | $22,563 |
9894 | 1 | $25 |
9895 | 1 | $29,688 |
9896 | 1 | $35,417 |
Rank in Wordlist | Frequency | Word |
---|---|---|
930 | 12 | Women's |
1464 | 8 | Women's Professional Soccer |
5723 | 2 | America's |
5868 | 2 | Byrne's |
6337 | 2 | Hallowe'en |
6577 | 2 | Loch a' Chàirn Bhàin |
7738 | 2 | d'Or |
10411 | 1 | 26°23'–28°08'W |
10601 | 1 | 5'6 |
10634 | 1 | 56°18'–59°27'S |
Rank in Wordlist | Frequency | Word |
---|---|---|
8399 | 2 | http://imagespoetry |
9931 | 1 | 1/3 |
9946 | 1 | 1043/4 |
10281 | 1 | 1949/1950 |
10818 | 1 | 9/11 |
12620 | 1 | Dheeb/Deeb |
12755 | 1 | Dweorfhunigbēo/lȳtlu |
13821 | 1 | Girl/Natural |
15183 | 1 | Ludwigshæfen/Rīne |
18117 | 1 | U1/U3 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots